Lau et al. Cell & Bioscience 201 1, 1:28 
http://www.cellandbioscience.eom/content/1/1/28 



Cell & Bioscience 



RESEARCH Open Access 



Transport genes and chemotaxis in Laribacter 
hongkongensis: a genome-wide analysis 

Susanna KP Lau 1 ' 2 ' 3 ' 4 * 1 , Rachel YY Fan 4t , Gilman KM Wong 4t , Jade LL Teng 4 Kong-Hung Sze 5 , Herman Tse 1 
Kwok-Yung Yuen 1 ' 2 ' 3 ' 4 and Patrick CY Woo 1 ' 2 ' 3 ' 4 * 



Abstract 

Background: Laribacter hongkongensis is a Gram-negative, sea gull-shaped rod associated with community- 
acquired gastroenteritis. The bacterium has been found in diverse freshwater environments including fish, frogs 
and drinking water reservoirs. Using the complete genome sequence data of L hongkongensis, we performed a 
comprehensive analysis of putative transport-related genes and genes related to chemotaxis, motility and quorum 
sensing, which may help the bacterium adapt to the changing environments and combat harmful substances. 

Results: A genome-wide analysis using Transport Classification Database TCDB, similarity and keyword searches 
revealed the presence of a large diversity of transporters (n = 457) and genes related to chemotaxis (n = 52) and 
flagellar biosynthesis (n = 40) in the L hongkongensis genome. The transporters included those from all seven 
major transporter categories, which may allow the uptake of essential nutrients or ions, and extrusion of metabolic 
end products and hazardous substances. L hongkongensis is unique among closely related members of 
Neisseriaceae family in possessing higher number of proteins related to transport of ammonium, urea and 
dicarboxylate, which may reflect the importance of nitrogen and dicarboxylate metabolism in this assacharolytic 
bacterium. Structural modeling of two C 4 -dicarboxylate transporters showed that they possessed similar structures 
to the determined structures of other DctP-TRAP transporters, with one having an unusual disulfide bond. Diverse 
mechanisms for iron transport, including hemin transporters for iron acquisition from host proteins, were also 
identified. In addition to the chemotaxis and flagella-related genes, the L hongkongensis genome also contained 
two copies of qseB/qseC homologues of the AI-3 quorum sensing system. 

Conclusions: The large number of diverse transporters and genes involved in chemotaxis, motility and quorum 
sensing suggested that the bacterium may utilize a complex system to adapt to different environments. Structural 
modeling will provide useful insights on the transporters in L hongkongensis. 



Background 

Laribacter hongkongensis is a Gram-negative, sea gull- 
shaped, rod that belongs to the Neisseriaceae family of 
P-proteobacteria [1,2]. The bacterium was first isolated 
from the blood and empyema pus of a man with alco- 
holic cirrhosis and bacteremic empyema thoracis in 
Hong Kong [1]. Using the selective medium, cefopera- 
zone MacConkey agar, the bacterium was subsequently 
isolated from the stool of patients with gastroenteritis 
[3,4] . In a multicenter case-control study, L. hongkongen- 
sis was shown to be associated with community- acquired 
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gastroenteritis, with recent travel and eating fish being 
risk factors [5], Apart from the human gut, L. hongkon- 
gensis has also been isolated from gut of freshwater ani- 
mals including fish and Chinese tiger frogs as well as 
water from drinking water reservoirs [2,5-9]. In order to 
adapt to the changing environments and intestines of dif- 
ferent animal hosts including human, fish and amphi- 
bians, L. hongkongensis must possess mechanisms to 
combat harmful substances in the environment and 
immune defense of animal hosts. 

Transport-related proteins of bacteria are important in 
allowing the uptake of essential nutrients or ions, and 
extrusion of metabolic end products and hazardous sub- 
stances. Bacteria employ different mechanisms for trans- 
port of different chemicals and these mechanisms have 
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been classified into seven major categories according to 
the Transport Protein Database (TCDB): channels and 
pores (class 1), electrochemical potential-driven transpor- 
ters (class 2), primary active transporters (class 3), group 
translocators (class 4), transmembrane electron carriers 
(class 5), accessory factors involved in transport (class 8), 
and incompletely characterized transport systems (class 9). 

Bacteria also possess sophisticated signaling systems to 
sense and adapt to various substances in the environ- 
ment. Depending on whether the environmental sub- 
stances are attractants or repellents, the bacterium may 
migrate towards or away from the substances, which 
include certain amino acids, sugars, and metal ions 
[10-12]. This sense-and-swim ability is important for 
bacteria to be able to find the suitable environment for 
optimal growth. Chemotaxis involves two separate sys- 
tems, the chemoreceptors located in the bacterial cell 
membrane which are important for sensing the binding 
compounds, and the transduction proteins which are 
involved in the downstream signal transduction in 
response to the stimuli. The chemoreceptors are also 
called methyl-accepting chemotaxis proteins (MCPs), 
which are reversibly methylated and function as homo- 
dimers [11,13]. 

The availability of the complete genome sequence of 
L. hongkongensis has allowed an opportunity to study its 
biology and important factors for adaptation to the chan- 
ging environment [14]. We have previously found that 
transport-related proteins, including all seven major cate- 
gories of transporters, account for about 14.1% of all cod- 
ing sequences in the L. hongkongensis genome, suggesting 
that this group of proteins may be important for survival 
of the bacterium in the various environments and hosts 
[14]. Genes related to motility and chemotaxis were also 
identified [14]. Except for the first strain isolated from 
blood culture and empyema pus of a patient which was 
likely a non-motile variant, all strains from patients with 
gastroenteritis, animals or environmental water samples 
are motile with polar flagellae [1,4-7,10], suggesting 
that chemotaxis and motility may be an important 
mechanism for environmental adaptation in most isolates 
of L. hongkongensis. In this study, a comprehensive analysis 
of putative transport-related genes and genes related 
to chemotaxis, motility and quorum sensing in the 
L. hongkongensis genome is performed. 

Results and discussion 

Transport genes in L. hongkongensis genome 

A huge diversity of transporters, including those from all 
seven major categories, were identified in the L. hongkon- 
gensis genome, as described in our previous complete gen- 
ome report [14]. This may reflect its ability to adapt to 
various environments, including freshwater animals, water 
and human intestines. These transporters included: (1) 48 



channels and pores, (2) 134 electrochemical potential- 
driven transporters, (3) 194 primary active transporters, 
(4) 9 group translocators, (5) 16 transmembrane electron 
carriers, (6) 7 accessory factors involved in transport and 
(7) 49 transporters of incompletely characterized transport 
systems (Table 1). 
Channels and pores 

The outer membranes of lipid bilayer envelopes of 
Gram-negative bacteria contain large numbers of water- 
filled transmembrane protein channels known as porins 
[15]. They serve as a molecular filter allowing for per- 
meation of hydrophilic molecules up to a certain size or 
specific solutes into the periplasmic space. Some bacter- 
ial porins also serve as receptor for phage and bacterio- 
cin binding [16]. X-ray crystoallography studies and 
atomic structures have revealed that porin molecules 
exists as trimers, with the transmembrane core com- 
posed of mostly P -sheets and some a-helixes [15]. The 
L. hongkongensis genome contained 48 coding sequences 
(CDSs) belonging to channels and pores, of which 17 
were a-type channels, 29 were p -barrel porins and 2 
were holins (Table 1). 

Among the 17 a-type channels, five were mechanosensi- 
tive channels, including one large conductance mechano- 
sensitive channel (LHK_02562) and four small 
conductance mechanosensitive channels (LHK_01830, 
LHK_01942, LHK_02394 and LHK_02965), which are 
responsible for mediating resistance to mechanophysical 
changes [17]. Interestingly, three CDSs encoding proteins 
of the ammonium transporter family were identified in the 
L. hongkongensis genome, as compared to only one copy 
such genes in Chromobacterium violaceum, the most clo- 
sely related bacterial species of the Neisseriaceae family 
with complete genome sequence available (Table 2). 
Moreover, a homologue of urea transporter responsible 
for urea uptake (LHK_01044) was also present in L. hon- 
gkongensis (Table 2), while this protein was absent in 
C. violaceum and the pathogenic Neisseria spp., Neisseria 
gonorrhoeae and Neisseria meningitidis. This may reflect 
the importance of nitrogen metabolism of the bacterium, 
as L. hongkongensis is assacharolytic and has been shown 
to use different pathways for arginine synthesis regulated 
at different temperatures [14]. In fact, the habitats of the 
closely related bacterial species are quite different from 
that of L. hongkongensis, where the latter can survive in 
human intestine in addition to diverse freshwater environ- 
ment. This may also explain its unique ability in maximiz- 
ing nitrogen metabolism. Among the P -barrel porins, the 
OmpA-OmpF-type porins are most well known in bac- 
teria to allow passive diffusion of hydrophilic substrates 
across the outer membrane. Three CDSs coding for puta- 
tive OmpA-OmpF-type porins were identified in the 
L. hongkongensis genome. Interestingly, two homologues 
of another p -barrel porin, fatty acid transporter gene 
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Table 1 Transporters in L hongkongensis and C. violaceum 



Category 
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C. violaceum 
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Phosphotransfer-driven group translocators 


2 






8 






Acyl CoA ligase-coupled transporters 


7 






10 






Transmembrane Electron Carriers 


16 


0.5 


3.5 


13 


0.3 


2.3 


Transmembrane 2-electron transfer carriers 


14 






12 






Transmembrane 1 -electron transfer carriers 


2 






1 






Accessory Factors Involved in Transport 


7 


0.2 


1.5 


20 


0.5 


3.6 


Auxiliary transport proteins 


7 






20 






Incompletely Characterized Transport Systems 


49 


1.5 


10.7 


33 


0.7 


5.9 


Recognized transporters of unknown 
biochemical mechanism 


15 






14 






Putative transport proteins 


34 






19 







Table 2 a-type channels in L hongkongensis and their closest homologues 


CDS 


Protein 


Closest match organism 


Best E- 
value 


Amino acid identity 

(%) 


LHK_02933 


Ammonium transporter 


L nitroferrum 


2.00E-146 


73.18 


LHK_03249 


Ammonium transporter 


Shewanello holifoxensis 


2.00E-118 


62.32 


LHK_03154 


Ammonium transporter family protein 


L nitroferrum 


1.00E-163 


78.99 


LHK_02207 


Flagellar motor protein MotA 


L nitroferrum 


1.00E-122 


74.48 


LHK_00970 


Ion transporter 


C. violaceum 


5.00E-78 


58.96 


LHK_02562 


Large-conductance mechanosensitive channel 


Pelodictyon luteolum 


2.00E-43 


56.95 


LHK_01830 


Transmembrane protein 


C violaceum 


2.00E-109 


57.52 


LHK_01942 


Mechanosensitive ion channel protein 


Janthinobacterium sp. Marseille 


5.00E-79 


41.26 


LHK_02394 


MscS Mechanosensitive ion channel 


L nitroferrum 


7.00E-55 


48.95 


LHK_02965 


Transporter, small conductance mechanosensitive ion channel 
family 


E. coli 0157:H7 


5.00E-73 


61.04 


LHK_02739 


Molecular chaperone DnaK 


C. violaceum 


0 


85.98 


LHK_02206 


OmpA/MotB domain protein 


L nitroferrum 


4.00E-97 


75.46 


LHK_01044 


Urea transporter 


Methylobacterium extorquens 
PA1 


1 .00E-65 


50.46 


LHK_00053 


TolQ-related transport transmembrane protein 


C. violaceum 


1 .00E-86 


74.66 


LHK_03174 


ToIR protein 


C. violaceum 


5.00E-30 


51.88 


LHK_00499 


Probable exbB-like biopolymer transport 


C. violaceum 


4.00E-55 


59.31 


LHK_00498 


Biopolymer transport exbD transmembrane protein 


Burkholderia pseudomalleil 1 2 


7.00E-36 


55.88 
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(fadL), were also found, which may be important for 
uptake of long-chain fatty acids in freshwater environ- 
ments poor in lipids or fatty acids. 
Electrochemical potential-driven transporters 
The L. hongkongensis genome possessed a large number of 
CDSs (n = 134) encoding for putative electrochemical 
potential-driven transporters, among which the majority 
(132 CDSs) were porters including uniporters, symporters 
and antiporters, while the remaining two CDSs were ion- 
gradient-driven energizers (Table 1). Of the 132 porters, 
19 (14.3%) belonged to the major facilitator superfamily 
(MFS). MFS proteins are important transporters in bac- 
teria, which allow transport of molecules by an electroche- 
mical ion gradient and typically contain a single subunit 
with 12 membrane-spanning helixes [18]. The MFS pro- 
teins of L. hongkongensis were predicted to mediate trans- 
port of diverse substrates including ions, drugs and 
metabolites. Another major family of porters were the 
resistance-nodulation-cell division (RND) superfamily (28 
CDSs), which are responsible for transporting a wide vari- 
ety of substrates including antibiotics, dyes, detergents, 
fatty acids, bile salts, organic solvents, heavy metals, auto- 
inducers and lipooligosaccharides in Gram-negative bac- 
teria [19,20]. Other porters belonged to diverse families of 
proteins which facilitate the transport of diverse sub- 
stances including ions, amino acids, drugs, heavy metal 
such as nickel and cobalt, nucleobase, C 4 -dicarboxylates 
and other metabolites. The presence of various porters 
may be involved in acquisition of essential substances for 
metabolism and bacterial resistance to environmental 
toxic substances including heavy metals. Interestingly, a 
total of 11 porters for dicarboxylate transport were found 
in L. hongkongensis genome, as compared to only 6 in 
C. violaceum and 1 each in N. meningitidis and N. gonor- 
rhoeae genomes (Table 3). C 4 -dicarboxylates are inter- 
mediates in TCA cycle that can be utilized by bacteria as 
nonfermentable carbon and/or energy sources under aero- 
bic or anaerobic conditions [21]. Some C 4 -dicarboxylates, 
such as succinate, oxalate and malate, can also be found in 
nature [22]. The presence of high number of C 4 -dicarbox- 
ylates transporters may reflect the ability of using 
C 4 -dicarboxylates as carbon sources in L. hongkongensis, 
as the bacterium is assacharolytic, lacking a complete gly- 
colytic pathway, and is in line with our experiments 



showing that L-malate can be used as its sole carbon 
source [14]. 

Six of the 11 porters for dicarboxylate transport found in 
L. hongkongensis genome were believed to form two DctP- 
type tripartite ATP-independent periplasmic (TRAP) 
transporters which belong a heterogeneous group of sub- 
strate-binding protein (SBP) -dependent secondary trans- 
porters of a diverse range of substrates found in bacteria 
and archaea [23-25]. The genes encoding the 3 subunits 
were arranged in an operon, with two membrane proteins 
DctQ and DctM associating with DctP to form a C 4 -dicar- 
boxylate TRAP transporter [26]. Several TRAP transpor- 
ters have been characterized in detail, with the structures 
of at least seven DctP-type SBP subunits determined [25]. 
These studies revealed significant structural and architec- 
tural similarities among the different SBPs, while high- 
lighting the differences that permitted these proteins to 
bind their respective substrates with high affinity and spe- 
cificity. Besides substrate recognition, it was also found 
that the SBP performs other essential functions [27], and 
likely interacts with the integral membrane components in 
a hitherto undiscovered manner. One operon 
(LHK_00983-00984-00985), encoding C 4 -dicarboxylate 
transporter, was found downstream of several genes 
related to allantoin regulation and utilization; while the 
other operon (LHK_01394-01393-01392) was located 
upstream of the maeB gene encoding NADP-dependent 
malate dehydrogenase. The SBP encoded by LHK_00983 
(DctP_00983) was a 331 aa protein containing a 22 aa N- 
terminal signal peptide, with a predicted molecular weight 
of 33.9 kDa. It possessed 48% amino acid identity to the 
closest homolog in Roseovarius sp. TM1035 (NCBI acces- 
sion no.: ZP_01881277). The SBP encoded by LHK_01394 
(DctP_01394) was a 335 aa protein containing a 24 aa N- 
terminal signal peptide, with a predicted molecular weight 
of 34.3 kDa. It possessed 74% amino acid identity to the 
closest homolog in C. violaceum ATCC12472. The homol- 
ogy model and structural alignment of the homology 
model showed that the overall structure of DctP_00983 
and DctP_01394 was very similar to the determined struc- 
tures of other DctP-type SBPs (Figure 1 and 2, and see 
Supplementary material). Similar to other DctP homologs, 
they were divided into two domains with conserved 
arrangements of a-helices and P -sheets, which are 



Table 3 Porters for dicarboxylates in L. hongkongensis and related bacteria 



Family 


L hongkongensis 


G violaceum 


N. meningitidis 


N. gonorrhoeae 


C4-Dicarboxylate Uptake (Dcu) Family 


0 


2 


0 


0 


Dicarboxylate/Amino AcidCation (Na or H) Symporter (DAACS) Family 


3 


1 


0 


0 


Tripartite ATP-independent Periplasmic Transporter (TRAP-T) Family 


6 


3 


0 


0 


Divalent Anion:Na+ Symporter (DASS) Family 


1 


0 


1 


1 


C 4 -dicarboxylate Uptake C (DcuC) Family 


1 


0 


0 


0 


Total 


11 


6 


1 


1 
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Figure 1 Homology model of DctP_00983 (panel A) and DctP_01394 (panel B), putative DctP TRAP transporters for C 4 -dicarboxylate 

in L hongkongensis. For DctP_00983, the C-score of the model was 1.49, which approximately corresponded to an expected TM-score of 0.92 
± 0.06 and an expected root-mean-square deviation (RMSD) of 3.2 ± 2.3 A from the native structure. The Ramachandran plot showed that 99.6% 
of aa are in the favored and allowed regions. Calculated G-factors for dihedral angles and main-chain covalent forces are 0.1 1 and -0.17 
respectively, with an overall average of 0.01. The Z-score of the model is -7.84, which is comparable to other experimentally determined protein 
chains of a similar size in the PDB. Local model quality analysis by plot of residue scores in ProSA-web did not reveal any problematic regions in 
the structure. The quality analysis results suggested that the homology model is mostly reliable with good structural qualities. For DctP_01394, 
the C-score of the model was 1.36, which approximately corresponded to an expected TM-score of 0.90 ± 0.06 and an expected RMSD of 3.5 ± 
2.4 A from the native structure. The Ramachandran plot showed that 99.0% of aa are in the favored and allowed regions. Calculated G-factors 
for dihedral angles and main-chain covalent forces are 0.09 and -0.17 respectively, with an overall average of 0.00. The Z-score of the model is 
-8.15, which is comparable to other experimentally determined protein chains of a similar size in the PDB. Local model quality analysis by plot of 
residue scores in ProSA-web did not reveal any problematic regions in the structure. The quality analysis results suggested that the homology 
model of DctP_01394 is also reliable with good structural qualities. 



connected by a characteristic hinge made up of two (3- 
strands and an a -helix. A highly conserved arginine resi- 
due in domain II is present in both proteins (Argl45 of 
DctP_00983 and Argl47 of DctP_01394), which 



corresponds to Argl47 in SiaP of H. influenzae essential 
to SBP function by forming a salt bridge with the carboxy- 
late group of the ligand [28]. Interestingly, a disulfide bond 
was predicted between the cysteine residues at positions 




Figure 2 Structural alignment of the homology model of DctP_00983 and DctP_01394, showing similar structures to other DctP-type 
SBPs (panel A) and a disulfide bond predicted between the cysteine residues at positions 129 and 182 of DctP_00983 (panel B). RMSD 
between DctP_00983 and the related structures ranged from 0.761 to 1.290 A. RMSD between DctP_01394 and the related structures ranged 
from 0.891 to 1.377 A. 
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129 and 182 for DctP_00983 (Figure 2) by homology mod- 
eling and sequence analysis. This structural feature was 
also found in the closest homolog in Roseovarius sp. 
TM1035, but absent from other related DctP-type SBP 
homologs including DctP_01394. 
Primary active transporters 

Primary active transporters mediate energy-driven trans- 
port of substances in and out of bacterial cells by using 
ATP hydrolysis, photon absorption, electron flow, sub- 
strate decarboxylation, or methyl transfer [29]. Primary 
active transporters were the most abundant class of 
transporters (194 CDSs), constituting 6% of CDSs in the 
L. hongkongensis genome, among which 150 belonged to 
P-P-bond-hydrolysis-driven transporters (Table 1). Of 
the 150 P-P-bond-hydrolysis-driven transporters, 109 
were ATP-binding cassette (ABC) transporters which are 
one of the largest groups of membrane proteins using 
energy from ATP hydrolysis for transport. In bacteria, 
they reside in the inner membrane and are involved in 
both uptake and export of a wide range of substances. All 
ABC transporters share a common basic structure which 
consists of four domains: two transmembrane domains, 
typically with six transmembrane spans per domain, and 
two cytoplasmic nucleotide-binding domains which cata- 
lyse nucleotide hydrolysis [30]. In bacteria, these domains 
are encoded as separate polypeptides. Determined by the 
structure of the transmembrane domain, ABC transpor- 
ters are typically specific for the substrates that they are 
responsible for, although some may transport for multi- 
ple related substances. As a result, the numbers of ABC 
transporters in different bacterial species vary widely, 
depending on its need for adaptation to varying environ- 
mental conditions [31]. The ABC transporters in the 
L. hongkongensis are likely involved in the active trans- 
port of diverse substances, including carbohydrate, amino 
acids or peptides, ions, vitamins, lipids, drugs and heavy 
metals including molybdenum, iron, zinc, cobalt, magne- 
sium, copper, cadmium, mercury, lead, arsenite and 
nickel. These systems were often arranged in gene clus- 
ters comprising the ATP-binding protein and two auxili- 
ary proteins, a permease and a substrate-binding protein. 
Compared to the 70 ABC transporters found in E. coli 
[31], the L. hongkongensis genome contained a large 
number of such proteins, reflecting its ability to adapt to 
different hosts and environment. 

Apart from P-P-bond-hydrolysis-driven transporters, 
other primary active transporters identified in the 
L. hongkongensis genome included oxidoreduction-dri- 
ven transporters (39 CDSs) and decarboxylation-driven 
transporters (5 CDSs), which use chemical energy to 
perform transport of charged or uncharged molecules 
across the membrane against the concentration gradient 
[32]. 



Group translocators 

Of the nine group translocators, two were phosphotrans- 
fer-driven group translocators and seven were acyl CoA 
ligase-coupled transporters belonging to the fatty acid 
transporter (FAT) family. The phosphotransferase group 
translocators are components of the bacterial phospho- 
transferase system (PTS), which catalyzes translocation of 
sugars and hexitols with concomitant phosporylation, and 
regulates the metabolism in response to the availability of 
carbohydrates. PTSs consist of two cytoplasmic proteins, 
enzyme I (EI) and HPr, and a variable number of sugar- 
specific transport complexes (Enzymes II sugar ) belonging to 
the group translocators. While the Escherichia coli genome 
encoded 38 different PTS proteins, the L. hongkongensis 
genome encoded only one gene for EI and HPr each and 
two genes for transporters, one containing protein-N p- 
phosphohistidine-sugar phosphotransferase IIA domain 
and the other containing nitrogen-regulatory fructose- 
specific IIA domain [33] . This is likely related to the rela- 
tive unimportance of sugar metabolism in L. hongkongensis. 
Transmembrane electron carriers 

There were 16 transmembrane electron carriers in the 
L. hongkongensis genome, including 14 transmembrane 
2- and two transmembrane 1-electron transfer carriers. 
Among the 14 transmembrane 2-electron transfer car- 
riers, 12 belonged to the prokaryotic molybdopterin-con- 
taining oxidoreductase (PMO) family, and the other 2 
belonged to the disulfide bond oxidoreductase D (DsbD) 
and B (DsbB) family respectively. 
Accessory factors involved in transport 
There were seven accessory factors belonging to auxili- 
ary transport proteins in the L. hongkongensis genome, 3 
belonging to the membrane fusion protein (MFP) family, 
2 to the phosphotransferase system enzyme I (EI) family, 
1 to the phosphotransferase system HPr (HPr) family 
and 1 to the stomatin/podocin/band 7/nephrosis.2/ 
SPFH (stomatin) family. 
Incompletely characterized transport systems 
Of the 49 CDSs belonging to incompletely characterized 
transport system, 15 were recognized transporters of 
unknown biochemical mechanism, with 6 belonging to the 
putative type VI symbiosis/virulence secretory pathway 
(VISP) family, 2 to the HlyC/CorC (HCC) family, 2 to the 
capsular polysaccharide exporter (CPS-E) family, 1 to the 
tellurium ion resistance (TerC) family and the remaining 4 
being metal ion transporters. The other 34 CDSs were 
putative transport proteins, including 2 CDSs of the cam- 
phor resistance (CrcB) family and 1 probable hemolysin 
III. 

Iron Transport in L. hongkongensis 

Iron is an essential metal for most microorganisms used in 
many key molecules involved in metabolism. In bacteria, 
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iron metabolism has been shown to be important in adap- 
tation to the environment especially within the host and 
as a result related to virulence. Diverse mechanisms for 
iron transport were identified in the L. hongkongensis gen- 
ome, suggesting that the bacterium is able to adapt to iron 
limitation present in human body which represents one of 
the non-specific immune response called induced hypofer- 
remia [34,35] 

Siderophores and iron uptake 

Siderophores are low molecular mass compounds with 
high affinity for ferric iron. In contrast to C. violaceum 
which produced siderophores for iron acquisition, pro- 
teins related to siderophore production were not found 
in L. hongkongensis genome. However, a homolog of 
TonB-dependent siderophore receptor (LHK_00497) was 
present, as described in our previous report [14]. 
Although Listeria monocytogenes also did not produce 
siderophores for iron acquisition, it was able to obtain 
iron by using either exogenous siderophores produced by 
various microorganisms or natural catechol compounds 
widespread in the environment [36,37]. It remains to be 
determined if L. hongkongensis can utilize exogenous 
siderophores or other natural iron-binding compounds 
for iron acquisition. 
Hemin transport 

Despite the inability to produce siderophores, a set of 
genes related to the transport of hemin were identified in 
L. hongkongensis genome (8 CDSs compared to 6 CDSs 
in C. violaceum). The 8 CDSs included TonB-dependent 
receptor (LHK_01193), hemin degrading factor (LHK_ 
01192), ABC transporter permease (LHK_01189), ferric 
citrate transport system ATP-binding protein (LHK_ 
01188), hemin-binding periplasmic protein (LHK_01190), 
hemin importer ATP-binding subunit (LHK_01427), 
hemin ABC transporter permease protein (LHK_01428) 
and Fur family ferric uptake regulator (LHK_01431). The 
conserved domains for hemin receptor, FRAP and 
NPNL, were also identified in the TonB-dependent 
receptor [38]. This suggests that L. hongkongensis is able 
to utilize iron source form host proteins, which may be 
important for survival in its hosts. Three other CDSs, 
homologous to fbpA (LHK_02634), fbpB (LHK_02635) 
and ATP-binding protein (LHK_02636), ABC transpor- 
ters for transferrin and lactoferrin, were also present, 
although the outer membrane receptor is not found. 
ABC transporters of the metal type 

A cluster of three genes encoding an ABC transporter of 
the metal type (homologous to that identified in C. viola- 
ceum) was identified in the L. hongkongensis genome. 
They encoded a periplasmic Mn 2+ /Zn 2+ -binding (lipopro- 
tein (surface adhesion A) (znuA), a Mn 2+ /Zn 2+ permease 
component (znuB) and the ATPase component (znuC). In 
addition, a gene encoding a putative cadmium-translocat- 
ing ATPase component (cadmium-translocating P-type 



ATPase) (CadA) (LHK_00449) was also present. A similar 
gene was also found in C. violaceum (CV1154), which was 
thought to be a surface adhesion A component for Mn 2 
+ /Zn 2+ binding. The Fur family ferric uptake regulator 
(zur) (LHK_01344) was also present. 
Other transporters 

In addition to the above transporters, two CDSs encod- 
ing ferrous iron transport proteins, feoA (LHK_03044) 
and feoB (LHK_03045), were identified in L. hongkon- 
gensis genome, which are believed to provide iron sup- 
ply under anaerobic or low pH conditions in bacteria 
[39]. Three other CDSs homologous to iron uptake 
ABC transporter periplasmic solute-binding protein 
(LHK_01590), ABC transporter permease (LHK_01593) 
and ABC transporter ATP-binding protein (LHK_01591) 
were also found. 
Iron storage 

Mechanism required for storage of iron after its acquisi- 
tion from the environment was present in L. hongkongen- 
sis, which mainly depends on two proteins: bacterioferritin 
(BFR) (LHK_01239, homologous to CV3399 in C. viola- 
ceum) and frataxin-like homolog (LHK_00023, homolo- 
gous to Daro_0208 in Dechloromonas aromatica). The 
BFR is an iron-storage protein with close similarity to the 
ferritins found in both eukaryotes and prokaryotes [40]. 
The frataxin-like homolog has been implicated in iron sto- 
rage in other bacteria. The frataxin-like domain is related 
to frataxin, the protein mutated in Friedreich's ataxia 
which is therefore proposed to result from decreased 
mitochondrial iron storage [41,42]. 
Regulation of iron transport 

Fur protein is a global repressor protein by forming Fur- 
Fe 2+ complexes that bind to iron-dependent promoter 
during iron-rich conditions. It regulates ferrichrome 
(fhuABCDG), ferric citrate (fecABCDE) and ferrous iron 
(feoABC) uptake systems. The Fur protein in L. hongkon- 
gensis was encoded in CDS LHK_01431 (homologous to 
FuraDRAFT_2340 in Lutiella nitroferrum). 

Chemotaxis in L. hongkongensis 

Methyl-accepting chemotaxis and chemosensory transducer 
proteins 

A total of 52 open reading frames (CDSs) were related to 
chemotaxis, of which 29 encoded MCPs and 22 were che- 
mosensory transducer proteins. Most genes encoding 
MCPs were scattered throughout the L. hongkongensis 
genome, while the genes encoding transducer proteins 
were mostly arranged in three gene clusters as described 
in our previous report (Table 4) [14]. 

All the predicted MCPs in L. hongkongensis possessed 
a transmembrane domain, which is compatible with 
their anticipated location in the bacterial cell membrane 
and function as receptors. Conserved domain structures 
were also identified in some of the MCPs. The plasmid 
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Table 4 CDSs related to chemotaxis in L. hongkongensis genome 



CDS 


Gene 


Product 


Organism with the 
closest matching 
sequences 


E-value 


Identities 


Clust< 


LHK_001 15 




histidine kinase, HAMP region: chemotaxis sensory transducer 


D. oromotico 


1e-96 


242/680 (35%) 




LHK_00482 




methyl-accepting chemotaxis sensory transducer 


L. nitroferrum 


4e-55 


1 64/543(30%) 




LHK 00516 




methyl-accepting chemotaxis sensory transducer 


L. nitroferrum 


8e-129 


265/513 (51%) 




LHK 00553 




diguanylate phosphodiesterase 


C. violoceum 


6e-1 1 1 


21 1/406 (51%) 


CA 


LHK 00554 


cheAl 


CheA signal transduction histidine kinase 


L. nitroferrum 


0 


443/613 (72%) 


CA 


LHK_00555 


cheZl 


chemotaxis phosphatase, CheZ 


L. nitroferrum 


2e-69 


139/244(59%) 


CA 


LHK_00556 


cheYl 


chemotaxis regulator protein CheY 


C. violaceum 


4e-61 


109/130 (83%) 


CA 


iiii/ r\r\r~ \ — 7 

LHK_00557 


cheVl 


chemotaxis protein CheV 


C. violoceum 


1 e-1 38 


240/314 (76%) 


CA 


LHK_00558 


cheV2 


chemotaxis protein CheV 


C violoceum 


5e-147 


251/313 (80%) 


CA 


LHK_00559 




two-component sensor histidine kinase 


L nitroferrum 


2e-59 


169/381 (44%) 


CA 


LHK_00560 




chemotaxis sensory transducer 


D. oromotico 


6e-24 


100/320 (31%) 


CA 


LHK_00561 


cheY2 


chemotaxis protein cheY 


D. oromotico 


8e-46 


85/121 (70%) 


CA 


LHK_00562 


cheA2 


chemotaxis protein CheA 


C. violoceum 


2e-161 


358/746 (47%) 


CA 


LHK_00563 


cheW 


CheW protein 


Burkholderio 
phytofirmons 


1e-40 


95/153 (62%) 


CA 


LHK_00564 




methyl-accepting chemotaxis protein 


C. violoceum 


4e-143 


315/475 (66%) 


CA 


LHK_00565 


cheR 


CheR chemotaxis protein methyltransferase 


Jonthinobocterium 
sp. Marseille 


5e-68 


125/273 (45%) 


CA 


LHK_00566 


cheBl 


chemotaxis-specific methylesterase 


Nitrosomonos 
europoeo 


2e-99 


186/355 (52%) 


CA 


LHK_00567 


cheD 


chemoreceptor glutamine deamidase CheD 


D. oromotico 


5e-59 


108/189 (57%) 


CA 


LHK_00603 




methyl-accepting chemotaxis protein 


C. violoceum 


7e-103 


242/624 (38%) 




LHK_00617 




methyl-accepting chemotaxis protein IV 


C violoceum 


2e-100 


223/481 (46%) 




LHK_00700 




methyl-accepting chemotaxis sensory transducer 


Allochromotium 
vinosum 


0 


384/715 (53%) 




1 1 11/ r\r\~i ^\ r 

LHK_00726 


oerl 


methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 


L nitroferrum 


7e-1 14 


232/528 (43%) 




LHK_00935 


cheR 


MCP methyltransferase, CheR-type 


L nitroferrum 


2e-92 


1 70/282 (60%) 




LHK_01020 




putative aromatic hydrocarbon chemotaxis transducer 


Azoorcus sp. 


4e-62 


140/338 (41%) 




1111/ Alii/' 

LHK_01 1 16 




methyl-accepting chemotaxis protein 


Denitrovibrio 
ocetiphilus 


1 e-59 


152/461 (32%) 




LHK_01212 




methyl-accepting chemotaxis sensory transducer 


L nitroferrum 


1 e-1 35 


261/476 (54%) 




LHK_01359 


cheY3 


chemotaxis regulator protein CheY 


C. violoceum 


1 e-56 


1 02/1 27 (80%) 


CB 


LHK_01360 


cheV3 


chemotaxis protein CheV 


C. violoceum 


1 e-1 34 


231/309 (74%) 


CB 


1111/ r\ 1 t r 1 

LHK_0l36l 




methyl-accepting chemotaxis sensory transducer 


L nitroferrum 


6e-47 


157/506 (31%) 


CB 


1 1 11/ A1 

LHKJJ1372 




chemotaxis sensory transducer 


D. oromotico 


4e-49 


166/534 (31%) 




1111/ r\ 1 a ~7 r\ 

LHK_01470 




putative aromatic hydrocarbon chemotaxis transducer 


Azoorcus sp. 


2e-93 


222/539 (41%) 




LHK_01602 




methyl-accepting chemotaxis sensory transducer 


L. nitroferrum 


0 


339/601 (56%) 




LHK_01618 




methyl-accepting chemotaxis sensory transducer 


L. nitroferrum 


2e-87 


209/525 (39%) 




LHK_01706 




methyl-accepting chemotaxis protein IV 


C violoceum 


1 e-1 21 


247/481 (51%) 




LHK_01721 




methyl-accepting chemotaxis protein 


C violoceum 


4e-1 13 


240/627 (38%) 




LHK_02037 




methyl-accepting chemotaxis sensory transducer 


Leptospirillum 
ferrodiozotrophum 


5e-63 


137/327 (41%) 




LHK_02158 


aerl 


methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 


Rolstonio pickettii 


6e-39 


98/276 (35%) 




LHK_02165 




methyl-accepting chemotaxis protein 


C violoceum 


8e-146 


275/631 (43%) 




LHK_02364 


cheB2 


response regulator receiver modulated CheB methylesterase 


Geobocter 
bemidjiensis 


le-63 


122/206 (59%) 




LHK_02427 




methyl-accepting chemotaxis protein 


C violoceum 


6e-1 10 


227/629 (36%) 


CC 


LHK_02428 




Hypothetical protein 


No 






CC 


LHK_02429 


cheV4 


response regulator receiver modulated CheW protein 


L nitroferrum 


2e-145 


248/313 (79%) 


CC 


LHK_02430 


cheV5 


chemotaxis protein CheV 


C. violoceum 


3e-137 


237/314 (75%) 


CC 


LHK_02431 


cheY4 


chemotaxis regulator protein CheY 


C. violoceum 


2e-58 


105/127 (82%) 


CC 


LHK_02432 


cheZ2 


chemotaxis phosphatase, CheZ 


L nitroferrum 


2e-63 


129/248 (52%) 


CC 
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Table 4 CDSs related to chemotaxis in L hongkongensis genome (Continued) 



LHK 


_02433 cheA3 


CheA signal transduction histidine kinase 


L nitroferrum 


0 


420/61 1 


(68%) CC 


LHK 


_02455 


methyl-accepting chemotaxis sensory transducer 


Condidotus 

Accumulibacter 

phosphates 


1e-74 


154/326 


(47%) 


LHK 


_02575 


putative Methyl-accepting or sensory transducer chemotaxis protein 


Alteromonadales 
bacterium 


le-83 


1 72/407 


(42%) 


LHK 


_02814 aer3 


chemotaxis sensory transducer 


Rhodopseudomonas 
palustris 


8e-42 


138/425 


(32%) 


LHK 


_02834 


methyl-accepting chemotaxis protein 


Pseudomonas 
syringae 


1e-45 


148/437 


(33%) 


LHK 


_03026 


methyl-accepting chemotaxis protein 


C. violaceum 


6e-145 


275/627 


(43%) 


LHK 


_03119 


methyl-accepting chemotaxis sensory transducer 


L nitroferrum 


2e-133 


273/514 


(53%) 


LHK 


_03163 


methyl-accepting chemotaxis sensory transducer 


Candidatus 

Accumulibacter 

phosphatis 


9e-50 


167/494 


(33%) 



a The Che proteins were encoded in three gene clusters, named CA, CB and CC (chemotaxis A, B and C clusters) 



achromobacter secretion (PAS) domain was found in 
four MCPs (LHK 00564, LHK.00726, LHK 02158 and 
LHK_02814). PAS domains are energy-sensing modules 
that are found in proteins from archaea to humans [43]. 
The histidine kinase adenylyl cyclase MCP and phospha- 
tase (HAMP) domain was present in 22 of the 29 MCPs. 
The HAMP domain interacts with the PAS domain for 
signal transduction in aerotaxis (oxygen-sensing) recep- 
tor in Escherichia coli [43], and possesses roles of regulat- 
ing the phosphorylation or methylation of homodimeric 
receptors by transmitting the conformational changes in 
periplasmic ligand-binding domains to cytoplasmic sig- 
naling kinase and methyl-acceptor domains [44]. 

These chemosensory transducer proteins work as two- 
component regulatory systems which typically consist of 
a sensory histidine kinase and a response regulator. The 
histidine kinase is usually a transmembrane receptor and 
the response regulator a cytoplasmic protein [45]. Fol- 
lowing autophosphorylation at a conserved histidine resi- 
due in response to changes in chemoreceptor occupancy, 
the histidine kinase serves as a phospho-donor for the 
response regulator. Once phosphorylated, the response 
regulator mediates changes in gene expression or cell 
motility. CheA is a typical sensory histidine kinase while 
CheY is a downstream regulator protein [46]. Upon 
phosphorylation, CheY binds to the FliM component at 
the base of the flagellar motor switch to induce clockwise 
rotation [47]. In contrast to the single copies of CheA 
and CheY in E. coli, the presence of 22 chemosensory 
transducer proteins, many with multiple copies including 
three CheA, one CheB, one CheD, two CheR, five CheV, 
one CheW, four CheY, and two CheZ, suggested that 
L. hongkongensis may utilize a complex transducer system 
to mediate chemotaxis response and adapt to environ- 
mental changes (Table 4). These Che proteins were 
encoded in three gene clusters, named CA, CB and CC. 
The first and largest cluster, CA, encoded two CheA, one 



CheR, two CheY, two CheV, one CheZ, and the single 
CheD and CheW. The second and smallest cluster, CB, 
encoded one CheV and CheY. The third cluster, CC, 
encoded one CheA, one CheY, two CheV and one CheZ. 
Phylogenetic analysis of CheAs, CheVs and CheYs of L. 
hongkongensis suggested that the multiple copies are the 
result of both horizontal transfer events and gene dupli- 
cation, as some of the copies were more closely related to 
the corresponding proteins in other bacteria while others 
were more closely related among the homologues of L. 
hongkongensis (Figure 3). 

The CheA proteins of L. hongkongensis were most clo- 
sely related to homologues in the closely related Chro- 
mobacterium violaceum and Lutiella nitroferrum with 
47% to 72% amino acid identities. CheA has five 
domains, PI to P5 [46]. All the three CheA proteins in 
L. hongkongensis contained these conserved domains. In 
the PI domain, the invariant histidine residue, which 
undergoes phosphorylation by the P4 domain, was also 
present. In the kinase domain P4, the four conserved 
regions designated the N, Gl, F and G2 boxes were also 
present in the three CheAs (Figure 4). 

The CheY proteins of L. hongkongensis were highly simi- 
lar to the homologues in C. violaceum and Dechloromonas 
aromatica, with 70% to 83% amino acid identities. Multi- 
ple alignment of the four CheY with that of E. coli showed 
the presence of all five amino acid residues conserved 
among response regulators [46,48]: aspartate at positions 
12, 13 and 57; threonine at position 87, and lysine at posi- 
tion 109, with the aspartate at position 57 representing the 
phosphorylation site (Figure 5). Residues that interact with 
P2 domain of CheA were identified. 

Other Che proteins are believed to be involved in the 
regulation of bacterial chemotaxis, although the exact 
function of some are not fully understood. Among 
them, CheB is known to work in conjunction with CheR 
in the reversible methylation of the MCPs. CheR is a 



Lau et al. Cell & Bioscience 2011, 1:28 Page 1 0 of 1 7 

http://www.cellandbioscience.eom/content/1 1\ /28 



CheA 



h 



0.05 



1000 



1000 



1000 



1000 



-C. violaceum CheA2 (AAQ58688) 

C. violaceum CheAl (AAQ60181) 

— L. hongkongensis CheAl (LHK_00554) 

L. hongkongensis CheA3 (LHK_02433) 

C. violaceum CheA (AAQ61 111) 

L. nitroferrum (ZP_03699451) 

Azoorcus sp. (YP_932966) 



978 



-E. coli (NP_754195) 

C. violaceum CheA (AAQ61 103) 

L. hongkongensis CheAl (LHK_00562) 



CheV 



CheY 



100 



100 



4" 



100 



Z. hongkongensis CheV3 (LHK_01360) 
Z. hongkongensis CheVl (LHK_00557) 
Z. hongkongensis CheV5 (LHK_02430) 
C. violaceum CheVl (AAQ61108) 
-Z. nitroferrum copy 1 (ZP_03699454) 

— Z). aromatica (YP_283961) 

— Thauera sp. (YP_002890196) 

— Azoarcus sp. copy 1 (YP 932963) 
— Limnobacter sp. (ZPO 19 16708) 



97 



100 



100 



-Z. nitroferrum copy 2 (ZP_03699455) 
-C. violaceum CheV2 (AAQ61107) 

C. violaceum CheV3 (AAQ60701) 

Z. hongkongensis CheV2 (LHK_00558) 
Z. hongkongensis CheV4 (LHK_02429) 
Azoarcus sp. copy 2 (YP_932962) 
— Thauera sp. copy 2 (YP_002890197) 



985 



829 



980 
1000 



1000 



-C. violaceum (AAQ60557) 
Azoarcus sp. (YP_932964) 
7%at/era sp. (YP 002890195) 
Pectobacterium atrosepticum (YP_049795) 

£. co/i (NP_754189) 
Methylobacittus flagellants (YP 54603 7) 
Z. nitroferrum (ZP 03699453) 

C. violaceum CheY3 (AAQ61109) 
Z. hongkongensis CheYl (LHK_00556) 
,Z. hongkongensis CheY4 (LHK_02431) 
'L^. hongkongensis CheY3 (LHK_01359) 



995 



999 



559 _ 
In" 



-C. violaceum CheYl (AAQ60183) 
C. violaceum CheY2 (AAQ58690) 

— Z// iobaci Hits denitrificans ( YP 3 15383) 
— Z>. aromatica (YP_283951) 

— C. violaceum CheY4 (AAQ61104) 

Z. hongkongensis CheYl (LHK_00561) 



Figure 3 Phylogenetic tree showing the relationships of the CheAs, CheVs and CheYs from L hongkongensis to those from other 
bacteria. The unrooted trees are constructed by using the neighbor-joining method using Kimura's two-parameter correction, with bootstrap 
values calculated from 1000 trees. The scale bar indicates the estimated number of substitutions per 20 bases. Bacterial names and accession 
numbers are given as cited in the GenBank database. 
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P1 domain 



Eco_cheA MHHRSVFPQC HQPNRGDSVS MDISDFYQTF FDEADELLAD MEQHLLVLQP -EAPDAEQLN 

LHK_00554 MSDF AGMEELLQDF LTESSELLSD VDNKLVELEK -YPEDKGLLN 

LHK_00562 MS IDLSQFHAAF FDEAAEHLET IERLLLESQP GQTTDGEALN 

LHK 02433 MSDF NGMEELLQDF LTESSELLSD VDNKLVELEK -YPEDKGLLN 



P1 domain 

Eco_cheA AIFRAAiSIK GGAGTFGFSV LQETTHLMEN LLDEARRGEM QLNTDIINLF LETKDIMQEQ 119 

LHK_00554 DIFRGFmTIK GGAGFLNATP LVTLCHRTEN LFDKLRNGEL KLNSHVMDVI LDATGVVRDM 103 

LHK_00562 AIFRAAiSIK GSAGTFGFAD IAGFTHGLEN LLDRIRRGEL PLTAGRISVC LKSRDVIADQ 102 

LHK_02433 DIFRGFfTIK GGAGFLNATP LVTLCHRTEN LFDKLRNGEL QLNSHVMDVI LDATGVVRDM 103 



| P1 domain | 

Eco_cheA LDAYKQ-SQE PDAASFNYIC QALRQLALEA KGETPSAVTR LS VVA KSEPQDEQSR 173 

LHK_00554 FGDLAQSRMP APAP 117 

LHK_00562 LAAHRDGATA VDPGRLAEIE AALLAAHDEG T-DMPVPVTS SRPDGGHYLL CWDRQLGHPE 161 

LHK 02433 FGDLAQSRMP APAP 117 



Eco_cheA 
LHK_00554 
LHK_00562 
LHK 02433 



P2 domain 



SQSPRRIILS RLKAGEVDLL EEELGHLTTL TDVVKGADSL SAILP -DDIAEDDIT 

QHILDNL DAVLSGQTAP VAAAPAAAAS QPGEPDWQQL YQAVVVPADA 

GLQDRLAALG TVQP-CGEQG EELVFQLQSG LSATDIAESL AFWLPPGQFR LEHLESQGDE 
QHILDNL DAMLAGQDMV RTVPGLPAVP DP ASVSQPASA 



227 
164 
220 
155 



P2 domain 

Eco_cheA AVLCFVIEAD QITFETVD-- VSPKISTPPM LKLAAEQAPT GRVEREKTTR 275 

LHK_00554 AAQLTSKSAP STAVAAAP-- APVPA ATPAASSPPP AQSRDRSPAS 207 

LHK_00562 AFGVFVDTQR WATS ATE PA APVDEGWGLF APPAAATADV SPPVAEPAAT GSTPALRQAT 280 

LHK_02433 PP AAS TVAVQAGP-- ATTAA AQPPE RERNISG 185 



| P3 domain | 

Eco_cheA SSESTSIRVA VEKVDQLINL VGELVITQSM LAQR S SELDPVNHGD LITSMGQLQR 330 

LHK_00554 SPQEATIRID TQRLDQVLNL SGEIGLTKNR LTTIRTEIMQ GNLGANTLRS LDEAVSQLDL 267 

LHK_00562 PQPETSIRVN VEKVDQLLNR IGELVITQSM LAQQ V ERLGTLASEE LQRGMAQLER 335 

LHK_02433 NQQETTLRVD AQRLDQVLNL SGEIGLTKNR LTTIRTEIIQ GNLGADVLHS LDEAVSQLDL 245 



P3 domain 



P4 domain 



Eco_cheA NARDLQESVM SIRMMPMEYV FSRYPRLVRD LAGKLGKQVE LTLVGSSTEL DKSLIERIID 390 

LHK_00554 LVGDLQNAVM KTRMQPIGRL FQKYPRLARD LARQLGKEVE LVLSGEETEL DKTMIEDLND 327 

LHK_00562 TTRELQEAVM SVRMLPVASV FGRFPRLVRE LGQKLGKAVE LQVIGEQTEI DKSFVEKLTD 395 

LHK 02433 LVGDLQNAVM KTRMQPIGRL FQKYPRLARD LARQLGKDVD LVLSGEETEL DKTMIEDLND 305 



P4 domain 



Eco_cheA ELTHLVRNSL DHG [ELPEKR LAAGKNSVGN LILSAEHQGG NIC1 EVTDDG AGlJ 

LHK_00554 E LVHLVRNAV DHGIETPEER LAAGKSAQSI VELSAQQVGD HI VI EVADDG RGM 

LHK_00562 ELTHLVRNSL DHGLESAEGR AGAGKPPVGR LTLRAFHQGG HIV1EVSDDG AGlJ 

LHK 02433 E LVHLVRNAV DHG [ETPDER QAAGKPVQSA VKLSAQQVGD HIV1 EVADDG RGM 



Eco_cheA 
LHK_00554 
LHK_00562 
LHK 02433 



Eco_cheA 
LHK_00554 
LHK_00562 
LHK 02433 



KAASQ GL TVSENMSDDE VAMI IFAPGF 

KALEKGLIDL EQANSLDDKQ ALHI IFLPGF 

KAREQ GL NVSDTMNDAE VWQIIFEPGF 

KALEKGLIDL EQANSMDDKQ ALHI IFLPGF 



STAEQVTDVS 
STKDQISSVS 
STAQAVTDVS 
TTKSEISSVS 



GRGVGM DVVK RNIQEMGGHV 
GRGVGM DVVR TNIQKLNGRI 
GRGVGM DVVR RNIEAMGGSI 
GRGVGM DVVR TNIQKLNGRI 

G2 



P4 domain 



P5 domain 



EIQSKQGTGT TIRILLPLTL AILDGMSVRV ADEVFILPLN AVMESLQPRE ADLHPLAGGE 

DISSAPNEGT KISISLPLTL AILPVLVVKV CNQPFAVPLA MVREIIPFSS DSIQEVSGRP 

RIESLAGVGT TISLHLPLTL AILDGMSIAI GNEIYILPLS QVVESLQPRA EDVFTLAGQH 

DIRSAPNEGT RISISLPLTL AILPVLVARV CDQSFAMPLA MVREIIPIAT DAIQEVSGRP 



507 
4 4 7 
512 
42 5 



567 
507 
5^2 

4 a 5 



I P5 domain | 

Eco_cheA RVLEVRGEYL PIVELWKVFN VAGAKTEATQ GIVVILQSGG RRYALLVDQL IGQHQVVVKN 627 
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P5 domain 



Eco_cheA LESNYRKVPG ISAATILGDG SVALIVDVSA LQAINREQRM ANTAA 672 

LHK_00554 LQN--IRPKG VAGATLSGDG AVVLVLDMED LLSNDRGDHK LARAGNMPML EAAAV 616 

LHK_00562 LETNYRRVRG LAAATILGDG QVAFILDGAE VVRMGQGG 670 

LHK_02433 LQN--VSLKG VAGATLSGDG AVVLVLDMDS LLNTDK— AH AAVHGALTPL PA 58 9 



Figure 4 Amino acid sequence alignments of L hongkongensis and E. coli CheAs. The conserved P1 to P5 domains are marked above the 
sequences. The histidine residue at potential phosphorylation site is shaded. The four conserved regions designated the N, G1, F and G2 boxes 
within P4 domain are marked in open boxes. 

V J 



constitutively active methyltransferase which methylates 
the conserved glutamine residues of MCPs, while the 
methylesterase CheB is responsible for demethylation 
[49,50]. Similar to CheY, the CheB of L. hongkongensis 



also contained the five conserved amino residues of 
response regulators. In addition, three conserved resi- 
dues of the catalytic site, serine at position 164, histidine 
at position 190 and aspartate at position 286, and the 
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Figure 5 Amino acid sequence alignments of L hongkongensis and E. coli CheYs. The conserved aspartate, threonine and lysine residues 

are shaded. The aspirate residue at potential phosphorylation site is marked by black square, and residues of E. coli CheY that interact with the 

P2 domain of E. coli Che A are marked by black triangles above the residues, 
k J 



GXGXXG nucleotide-binding-fold sequences conserved 
among CheB proteins were also present (Figure 6) [51]. 

Similar multiple copies of chemosensory transducer 
proteins have also been reported in C. violaceum and 
Rhodobacter sphaeroides [46,48]. Interestingly, the orga- 
nization of the first cluster in L. hongkongensis, CA, was 
similar to one of the three clusters, cluster 3, in C. 



violaceum, although some of the genes were in opposite 
coding direction. In R. sphaeroides, it has been shown 
that some of the multiple copies of Che proteins are 
essential (e.g. CheA2) while others are not (e.g. CheAl) 
although the multiple chemosensory protein homolo 
gues are not redundant [46,52]. Further studies are 
required to investigate the differential function of the 
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Figure 6 Amino acid sequence alignment of L hongkongensis and E. coli CheBs. The 5 conserved aspartate, threonine and lysine residues 
also found in CheY are shaded. The three conserved residues of the catalytic site Ser164, Hisl 90 and Asp286 in E. coli CheB are marked by 
triangles above the residues and the GXGXXG nucleotide-binding-fold consensus sequences of other CheB marked in open box. 



Lau et al. Cell & Bioscience 201 1, 1:28 
http://www.cellandbioscience.eom/content/1/1/28 



Page 13 of 17 



multiple copies of chemosensory transducer proteins in 
L. hongkongensis. 

Flagellar proteins in L hongkongensis 

A total of 40 CDSs, arranged in six gene clusters, were 
likely involved in the biosynthesis of flagella in L. hon- 
gkongensis (Table 5). These six clusters, FA, FB, FC, FD, 



FE and FF, encoded 11, 3, 5, 2, 16 and 3 genes respec- 
tively. The organization and gene contents of the first 
five clusters were highly similar to five of the seven clus- 
ters of flagellar genes (clusters 1, 2, 4, 5 and 7) previously 
found in C. violaceum [48], which is also a motile bacter- 
ium found in multiple ecosystems, including water and 



Table 5 CDSs involved in flagella biosynthesis in L hongkongensis genome 



CDS 


Gene 


Product 


Organism with th6 
closest matching 
sequences 


E-value 


Idpntitip^ 


Cluster 0 


LHK_00436 


flgL 


flagellar hook-associated protein 3 


L. nitroferrum 


1e-59 


127/312 (40%) 


FA 


LHK_00437 


flgK 


flagellar hook-associated protein FlgK 


L nitroferrum 


1 e-1 09 


258/634 (40%) 


FA 


LHK_00438 


flgj 


flagellar rod assembly protein/muramidase FlgJ 


L nitroferrum 


3e-68 


144/296 (48%) 


FA 


LHK_00439 


flgl 


flagellar basal body P-ring protein 


C violoceum 


1e-95 


197/294 (67%) 


FA 


LHK_00440 


fight 


flagellar L-ring protein 


L. nitroferrum 


4e-60 


122/231 (52%) 


FA 


LHK_00441 


flgG 


flagellar basal-body rod protein FlgG 


Ralstonio pickettii 


2e-92 


162/260 (62%) 


FA 


LHK_00442 


flgF 


flagellar basal-body rod protein FlgF 


L nitroferrum 


1e-75 


143/246 (58%) 


FA 


LHK_00443 


flgE 


flagellar basal body FlaE domain-containing protein 


Pseudomonas putido 


4e-76 


212/598 (35%) 


FA 


LHK_00444 


flgD 


flagellar hook capping protein 


L nitroferrum 


8e-38 


88/240 (36%) 


FA 


LHK_00445 


flgC 


flagellar basal-body rod protein flgC 


C. violaceum 


2e-49 


92/136 (67%) 


FA 


LHK_00446 


flgB 


flagellar basal-body rod protein FlgB 


L nitroferrum 


2e-41 


89/136 (65%) 


FA 


LHK_00584 


flgN 


FlgN family protein 


C. violoceum 


2e-15 


48/131 (36%) 


FB 


LHK_00585 


flgM 


anti-sigma-28 factor, FlgM 


L nitroferrum 


4e-09 


36/59 (61%) 


FB 


LHK_00586 


flgA 


flagella basal body P-ring formation protein FlgA 


L nitroferrum 


2e-36 


85/206 (41%) 


FB 


LHK_00781 


fliA 


RNA polymerase sigma factor for flagellar operon 


C. violaceum 


5e-89 


165/242 (68%) 


FC 


LHK_00782 


fleN 


flagellar synthesis regulator FleN 


L. nitroferrum 


3e-49 


121/268 (45%) 


FC 


LHK_00783 


flhF 


flagellar biosynthesis regulator FlhF 


C. violaceum 


1 e-1 1 9 


250/504 (49%) 


FC 


LHK_00784 


flhA 


flagellar biosynthesis protein FlhA 


L nitroferrum 


0 


519/682 (76%) 


FC 


LHK_00785 


flhB 


flagellar biosynthetic protein FlhB 


L. nitroferrum 


2e-136 


226/378 (59%) 


FC 


LHK_02206 


motB 


OmpA/MotB domain protein 


L. nitroferrum 


6e-1 1 1 


206/273 (75%) 


FD 


LHK_02207 


mot A 


flagellar motor protein MotA 


L nitroferrum 


9e-123 


213/286 (74%) 


FD 


LHK_02348 


fliR 


flagellar biosynthetic protein FliR 


L nitroferrum 


1e-60 


142/258 (55%) 


FE 


LHK_02349 


fliQ 


flagellar biosynthetic protein FliQ 


L nitroferrum 


6e-24 


65/89 (73%) 


FE 


LHK_02350 




GCN5-related N-acetyltransferase 


Methylocella silvestris 


5e-09 


47/150 (31%) 


FE 




fliP 


flagellar biosynthesis protein FliP 


C. violaceum 


7e-95 


170/1C1 (~I(~\Ci/\ 

\ /of Ida (/(J%J 


FE 


LHK_02352 


fliO 


flagellar protein FliO 


C. violaceum 


2e-16 


52/100 (52%) 


FE 


LHK_02353 


fliN 


flagellar motor switch protein FliN 


L nitroferrum 


2e-54 


111/140 (79%) 


FE 


LHK_02354 


fliM 


flagellar motor switch protein FliM 


L nitroferrum 


3e-160 


272/327 (83%) 


FE 


LHK_02355 


fill 


flagellar fliL transmembrane protein 


C. violaceum 


2e-28 


64/136 (47%) 


FE 


LHK_02356 


fliK 


flagellar hook-length control protein 


Nitrosomonas 
europaea 


5e-18 


41/108 (37%) 


FE 


LHK_02357 


flU 


flagellar export protein FlU 


L nitroferrum 


3e-20 


64/142 (45%) 


FE 


LHK_02358 


fill 


flagellar protein export ATPase Flil 


L nitroferrum 


0 


331/453 (73%) 


FE 


LHK_02359 


fliH 


flagellar assembly protein FliH 


L nitroferrum 


8e-32 


109/275 (39%) 


FE 


LHK_02360 


fliG 


flagellar motor switch protein FliG 


L nitroferrum 


2e-148 


261/332 (78%) 


FE 


LHK_02361 


fliF 


flagellar M-ring protein FliF 


L nitroferrum 


0 


339/585 (57%) 


FE 


LHK_02362 


fliE 


flagellar hook-basal body complex subunit FliE 


L nitroferrum 


5e-27 


69/110 (62%) 


FE 


LHK_02363 




two component, sigma54 specific, transcriptional regulator, Fis family 


L nitroferrum 


2e-143 


279/450 (62%) 


FE 


LHK_02703 


fliD 


flagellar hook-associated 2 domain protein 


L nitroferrum 


5e-45 


136/445 (30%) 


FF 


LHK_02704 


floG 


FlaG flagellar protein 


Janthinobacterium 
sp. Marseille 


2e-11 


38/105 (36%) 


FF 


LHK_02705 


fllC 


flagellin domain-containing protein 


Acidovorax sp. 


2e-73 


159/288 (55%) 


FF 



a The flagellar proteins were arranged in six gene clusters, FA, FB, FC, FD, FE and FF (flagellar A, B, C, D, E and F clusters) 
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soil On the other hand, the pathogenic Neisseria species, 
Neisseria gonorrhoeae and Neisseria meningitides, which 
also belong to the same Neisseriaceae family, are non- 
motile with humans being the only host and reservoir, 
and do not possess flagellar genes. 

A bacterial flagellum is typically composed of three 
parts, the filament formed by flagellin subunits, basal 
body attached to the bacterial cell membrane, and the 
hook which links between the filament and basal body 
[53]. All the major proteins that form these flagellar com- 
ponents were present in the L. hongkongensis genome. 
They included FliC and FliD which form the major part 
of the filament; FlgE, FlgK and FlgL which form the hook 
and hook-filament junction; and Fig B, FlgC, FlgH, Flgl, 
FlhA, FlhB, FliF, FUG, FliH, Flil, FliM, FUN, FliO, FliP, 
FliQ, FliR, MotA and MotB which form the basal body 
and flagellar-motor complex. Putative regulators of these 
flagellar proteins were also identified. FlgD and FliK are 
regulators of the hook component FlgE. FlgA, FlgN (both 
being chaperon proteins) and FliJ are involved in export 
of flagellar components. The anti-sigma factor gene FlgM 
and g28 FliA that regulates late gene products were also 
present. However, similar to C. violaceum, the L. hon- 
gkongensis genome lacked the FlhDC operon genes, sug- 
gesting that the regulation of flagellar protein expression 
is controlled by FlgM/FliA in this group of bacteria. 
Quorum sensing in L hongkongensis 

In addition to chemotaxis through which bacteria can 
rapidly adapt to environmental changes, quorum sensing 
is another way to assess the environment and to recognize 
the host. Quorum sensing is a signaling system through 
which bacteria can communicate among themselves by 
the production of and response to chemical signals called 
autoinducers [54]. In response to the changing concentra- 
tions of these autoinducers, downstream gene expression 
can be regulated. This cell-to-cell communication system, 
first identified in Vibrio harveyi in the regulation of biolu- 
minescence, is now known to exist in diverse bacteria, 
especially those that reside in the gastrointestinal tract 
where recognition of the host may be important for survi- 
val and virulence gene expression [54,55]. Among the 
three major quorum-sensing mechanisms, including the 
LuxR-I, LuxS/AI-2, and AI-3/epinephrine/norepinephrine 
systems, known to be utilized by enteric bacteria, only the 
latter was found in the L. hongkongensis genome, suggest- 
ing that this system played a major role in quorum-sensing 
in the bacterium [14]. 

The AI-3/epinephrine/norepinephrine system is 
involved in inter-kingdom cross-signaling and regulation 
of virulence gene transcription and motility [54]. This 
mechanism is best characterized in enterohemorrhagic 
E. coli (EHEC) which causes fatal hemorrhagic colitis and 
hemolytic uremic syndrome. It has been shown that the 
locus of enterocyte effacement (LEE), an important 



virulence factor in EHEC, and the flagellar genes of 
EHEC are regulated by the AI-3 system which involves 
AI-3 produced by the commensal gastrointestinal micro- 
flora and/or epinephrine/norepinephrine produced by 
the host [56,57]. The AI-3 system has also been impli- 
cated in biofilm formation in enteropathogenic E. coli 
(EPEC) [58]. Clarke et al. have recently identified the pro- 
tein, QseC that binds to AI-3 and epinephrine/norepi- 
nephrine, suggesting its involvement in the AI-3 system 
[59]. QseC belongs to a two-component system, QseB/C, 
in which QseC is the sensor kinase and QseB the 
response regulator. QseB/C has also been shown to be 
involved in activation of the flagella regulon and viru- 
lence in a rabbit model for EHEC [59,60]. The L. hon- 
gkongensis genome contained two sets of genes, 
LHK_00329/LHK_00328 and LHK_1812/LHK_1813, 
homologous to qseB/qseC [14], most closely related to 
homologues in C. violaceum and Azoarcus sp. strain 
BH72 respectively. The two qseB genes in L. hongkongen- 
sis possessed the response regulator receiver domain 
(PF00072) and the C-terminal domain of transcriptional 
regulatory protein (PF00486) previously found in the 
QseB of E. coli. The two qseC genes in L. hongkongensis 
also contained the His Kinase A (phosphoacceptor) 
domain (PF00512) and the histidine kinase-, DNA gyrase 
B-, and HSP90-like ATPase domain (PF02518) previously 
identified in the QseC of E. coli. The presence of two 
copies of qseB/qseC suggested that the AI-3 system may 
be an important mechanism for adaptation to the chan- 
ging environment and animal hosts for L. hongkongensis. 

Conclusions 

A large number of diverse transporters (n = 457), includ- 
ing those from all seven major transporter categories, 
were identified in the L. hongkongensis genome. A diver- 
sity of genes involved in chemotaxis, motility and 
quorum sensing were also found. This suggested that the 
ability to transport various substances plays an important 
role in the physiology or survival of L. hongkongensis, 
which may also utilize a complex system to mediate che- 
motaxis response and adapt to and survive in the rapidly 
changing environments. In particular, the bacterium is 
unique among closely related members of Neisseriaceae 
family in possessing higher number of proteins related to 
transport of ammonium, urea and dicarboxylate, which 
may reflect the importance of nitrogen and dicarboxylate 
metabolism in L. hongkogensis which is assacharolytic. 
Structural modeling of two C 4 - dicarboxylate transporters 
showed that they possessed similar structures to the 
determined structures of other DctP-TRAP transporters, 
but one with a rarely seen disulfide bond. A large number 
of ABC transporters were also identified. These suggest 
that the bacterium may be able to transport a wide vari- 
ety of substrates including antibiotics, dyes, detergents, 
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fatty acids, bile salts, organic solvents, ions, amino acids, 
drugs, heavy metals such as nickel and cobalt, nucleo- 
base, C 4 -dicarboxylates and other metabolites. Diverse 
mechanisms for iron transport, including hemin trans- 
porters for iron acquisition from host proteins, were 
identified, suggesting that the bacterium may adapt to 
iron limitation present in human host. Using blastp of all 
transporters against rcsb pdb, many of these genes were 
also found to have homolgous proteins of high sequence 
identities with known structures (data not shown). The 
large number of chemosensory transducer proteins, 
many having multiple copies arisen from both horizontal 
transfer events and gene duplications, may constitute a 
complex transducer system for mediating chemotaxis 
response and adapt to environmental changes. The pre- 
sence of two copies of qseB/qseC homologs suggests that 
L. hongkongensis may use the AI-3 system for cross-king- 
dom quorum-sensing and regulation of potential viru- 
lence factors. Further studies are required to better 
characterize the precise target substance for transport 
proteins of interest, and the targets regulated by qseB/ 
qseC in L. hongkongensis, which may shed light on its 
potential mechanisms for pathogenicity. Structural mod- 
eling can be a useful tool to provide useful structural 
insights about these genes in L. hongkongensis. 

Methods 

Transport genes were identified and classified according 
to Transport Classification Database TCDB http://www. 
tcdb.org/ and manual annotation. These CDSs were 
from COG C (Energy production and conversion), COG 
D (Cell cycle control, cell division, chromosome parti- 
tioning), COG E (Amino acid transport and metabo- 
lism), COG F (Nucleotide transport and metabolism), 
COG G (Carbohydrate transport and metabolism), COG 
H (Coenzyme transport and metabolism), COG I (Lipid 
transport and metabolism), COG J (Translation, riboso- 
mal structure and biogenesis), COG K (Transcription), 
COG L (Replication, recombination and repair), COG 
M (Cell wall/membrane/envelope biogenesis), COG N 
(Cell motility), COG O (post-translational modification, 
protein turnover, chaperones), COG P (Inorganic ion 
transport and metabolism), COG Q (Secondary metabo- 
lites biosynthesis, transport and catabolism), COG R 
(General function prediction only), COG S (Function 
unknown), COG T (Signal transduction mechanisms), 
COG U (Intracellular trafficking, secretion and vesicular 
transport) and COG V (Defense mechanisms). CDSs 
that were classified to COG N (cell motility) and COG 
T (signal transduction mechanisms), and COG M (cell 
wall/membrane/envelope biogenesis) were manually 
annotated for identification of genes related to chemo- 
taxis, motility and quorum sensing. CDSs from other 
COGs were searched for additional genes using 



keywords: chemotaxis, che, MCP, flagellar etc. All puta- 
tive genes were studied by manual curation based on 
the BLASTx result or multiple alignments. Phylogenetic 
relationships were determined using Clustal x version 
1.81. Protein family analysis was performed using PFAM 
[61]. Results were also compared to those of N. gonor- 
rhoeae, N. meningitidis, C. violaceum, which were the 
other bacterial species in the Neisseriaceae family with 
complete genome sequences available, where appropri- 
ate [29,62-70]. Genes encoding TRAP transporters were 
located and annotated as described above. Sequence 
analysis for the presence of signal peptide and trans- 
membrane domains were performed using SignalP v3.0 
and TMHMM v2.0 servers respectively [71,72], Identifi- 
cation of homologs in other bacteria was performed by 
using BLASTP sequence similarity search against the nr 
database in NCBI GenBank. The predicted sequences of 
mature SBPs were submitted to the I-TASSER server for 
homology modeling using default parameters and avail- 
able structures of several DctP-type SBP homologs (PDB 
code: 3B50, 2XA5, 3GYY, 3FXB, 2HPG, and 2CEY) as 
templates [73]. If multiple homology models were 
returned, then the best model was selected for further 
analysis based on the C-score. Quality assessment of the 
homology model was performed using PROCHECK [74] 
and ProSA-web [75]. Presence and connectivity of disul- 
fide bonds in the protein were predicted using the DiA- 
NNA vl.l server [76]. Structural alignment of the 
homology models of SBPs in L. hongkongensis and 
related structures in Protein Data Bank (http://www. 
pdb.org) was performed using the MatchMaker tool of 
UCSF Chimera with selected structures (PDB code: 
2HZK, 2CEY, 2 VPN, 2PFZ, 2PFY, and 2ZZV) [77]. 
Molecular images were generated using UCSF Chimera. 
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